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Abstract 

We consider n agents located on the vertices of a connected graph. Each agent v receives a 
signal Xy{0) ^ N{fi, 1) where fi is an unknown quantity. A natural iterative way of estimating 
/Li is to perform the following procedure. At iteration t + 1 let Xy{t + 1) be the average of A„(t) 
and of Xii;{t) among all the neighbors w oi v. It is well known that this procedure converges to 
X{oo) — ^\E\^^ J2 dyXy where dy is the degree of v. 

In this paper we consider a variant of simple iterative averaging, which models "greedy" 
behavior of the agents. At iteration t, each agent v declares the value of its estimator A„(i) to 
all of its neighbors. Then, it updates Xy(t + 1) by taking the maximum likelihood (or minimum 
variance) estimator of /i, given Xy{t) and Xw{t) for all neighbors w of v, and the structure of 
the graph. 

We give an explicit efficient procedure for calculating Xy{t), study the convergence of the 
process as t — > cx) and show that if the limit exists then A„(oo) = Au,(cx)) for all v and w. 
For graphs that are symmetric under actions of transitive groups, we show that the process is 
efEcient. Finally, we show that the greedy process is in some cases more efficient than simple 
averaging, while in other cases the converse is true, so that, in this model, "greed" of the 
individual agents may or may not have an adverse affect on the outcome. 

The model discussed here may be viewed as the Maximum-Likelihood version of models 
studied in Bayesian Economics. The ML variant is more accessible and allows in particular to 
show the significance of symmetry in the efficiency of estimators using networks of agents. 
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1 Introduction 



Networks and graphs are often viewed as computational models. In computational complexity 
several complexity classes are studied in terms of corresponding computation graphs, for example 
finite-automata, PSPACE and LOG-SPACE. For general background see, e.g., [1]. In parallel 
computing, networks are used to model the communication network between different computers, 
while in sparse sensing the connectivity network is of fundamental computation significance (see, 
e.g., [2] and [9]). 

A recent trend emanating from Economics and Game Theory considers networks where different 
nodes correspond to computational entities with different objectives [5]. Recent models in Bayesian 
Economics consider models where each player is repeatedly taking actions that are based on a signal 
he has received that is correlated with the state of the word and past actions of his neighbors ([8], 

In this paper we study a simple model where, in each iteration, agents iteratively try to optimally 
estimate the state of the world, which is a single parameter G M. It is assumed that originally 
each agent receives an independent sample from a normal distribution with mean fj,. Later at each 
iteration each agent updates his estimate by taking the maximum likelihood estimator of /i given 
its current estimator and those of its neighbors, and given the graph structure. Note that for normal 
distributions, the maximum likelihood estimator is identical with the minimum variance unbiased 
estimator. At the first iteration, the estimator at each node will be the average of the original 
signal at the node and its neighbors. However, from the second iteration on, the procedure will 
not proceed by simple averaging due to the correlation between the estimators at adjacent nodes. 
As we show below, this correlation can be calculated given the structure of the graph and results 
in dramatic differences from the simple averaging process. Note that under this model, the agents 
are memory less and use only the results of the last iteration to calculate those the next. 

The model suggested above raises a few basic questions: 

• Is the process above well defined? 

• Can the estimators be efficiently calculated? Note that in the Bayesian economic models 
(such as [7J) there are no efficient algorithms for updating beliefs. 

• Does the process converge? 

We answer the first two questions positively, and conjecture that the answer to the third is positive 
as well. Once these questions are addressed we prove a number of results regarding the limit 
estimators including: 

• We show that for connected graphs, as t ^ oo, the correlation between the estimators of the 
different agents goes to one. 

• We describe a graph for which the maximum likelihood process converges to an estimator 
different than the optimal. 

• We compare the statistical efficiency of the limiting estimator to the limiting estimator ob- 
tained by simple iterative averaging and to the optimal estimator, in different graphs. 

1.1 Formal Definition of the Model 

We consider a finite, undirected, connected graph G = (y,E), where each vertex has a self-loop so 
that \/v : {v, v) £ E, and a state of the world /i G M. We assign each vertex v a normal unbiased 
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estimator = Xi,(0) of /x so that E[X^] = /x and Var[Xi,] = 1, for all v. These estimators are 
uncor related. 

In iteration t G N we define Xy{t + 1) to be the minimum variance unbiased estimator con- 
structible over the estimators of v and its neighbors N(v) = {w\{v^w) G at time t 

X^{t + l)= a^X^it), where: (1) 

a^, = 1, and a minimizes Var[^^ a^X^(f)]. (2) 

weN{v) w 

(note that a may be positive or negative). The process is given by simple iterative averaging so 
y;;(0) := X^, and 

Y,{t + l) = j- ^ YUt). (3) 

^ weN{v) 

It is well known that converges 

Finally, we define Z{oo) to be the global minimum variance unbiased estimator: 



Note that in the case of normally distributed X^^s, the minimum variance definitions coincide with 
those of maximum likelihood. 

This scheme can be generalized to the case where the original estimators X^ have a general 
covariance structure, with the definitions for and Yy{t) remaining essentially the same, and 

that of Z(oo) changing to the form of Eq. H] below. 

1.2 Statements of the main results 

• The process defined by Eqs. [I]and[2]is well defined. More formally: 

Proposition (j2.ip . For every realization of the random variables Xy{0),v € V and for all 

t >1, X^{t) is uniquely determined. 

• The process can be calculated efficiently: 

Proposition ()2.2p . It is possible to calculate {Xtj{t)\v £ V}, given {Xv{t — l)\v £ V}, by 
performing n operations of finding the point of an n dimensional affine space ( as specified by 
a generating set of size at most n ) with minimal L2 norm. 

Calculating the latter is a classical convex optimization problem. See, e.g., [Ij. 

• For transitive graphs the process always converges to the optimal estimator: 

Proposition (j2.7p . Let G be a transitive graph (defined below). Then Xy{t) converges to 
X{qc) = Z{oo). 

• For graphs of large maximal degree, Xy(t) converge to /x: 

Proposition (13. ip . LetGn = {Vn,En) be a family of graphs where \Vn\ = n anc?maxt,gy^ d^ — > 
00 . Then 

lim sup lim E[(X^(t) - ^f] = 0. 

n^oo ^gy^ t^oo 
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Note by comparison that for any graph, 

B[{Y{^)-^,)'] = l\E\-'Y^dl 

In particular for a star on n vertices, as n — > oo it holds that X{oo) converges to /i but y(oo) 
does not. 

• Finally, for some graphs, the process converges to a limit different than Z(oo) and y(oo). 

Theorem (jA.ip . Let G = {V,E) be the interval of length four where V = {a,b,c,d} and 
E = {{a,b},{b,c},{c,d}} . Then Xy{t) converges to a limit X{oo), where 

X(oo) = i [(1 - OiXa + Xd) + (1 + OiXb + Xc)] , Var[X(oo)] = e, 
with£, = 2-V3 = i(l + \/49 - a/48). 
Note that for this graph 

y(cx)) = ^ ((1 - l){Xa + Xd) + (1 + l){Xt + X,)) , Var[y(oo)] = 0.26, 

and 

Z{oo) = ^{Xa + Xb + Xc + Xd) , Var[Z(oo)] = \. 

1.2.1 Conjectures 

Showing some supporting results, we conjecture that the process always converges, and in particular 
to a state where all agents have the same estimator. 

We present a number of additional open problems and conjectures in the conclusion. 

2 General Proofs 

2.1 Process is Well Defined 

Proposition 2.1. For every realization of the random variables X^(0),v E V and for all t > 1, 
X^(t) is uniquely determined. 

Proof. Let xi^\t) = A and Xy'^\t) = B he minimum variance estimators satisfying Eqs. [1] and 
[21 with variance V. Then their average must have variance at least V, since it also is a linear 
combination of the estimators from which A and B were constructed: 

V < Var[^{A + B)] 

V < iVarA + iVarB + iCov(A,5) 

V < + iCov(yl,S) 

V < Cov{A,B). 

Since Cov{A,B) < VVarAVarB = V, then Cov{A,B) = V and A = B. Therefore, there 
exists a unique minimum variance unbiased estimator and the process is well defined. □ 
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2.2 The Algorithm for Calculating the Estimator 

We present an efficient algorithm to calculate Xy{t). Let E^{t) = {Xyj{t)\w £ N{v)} be the 
estimators of agent v's neighbors at time t. Let C be the covariance matrix of Ey(t), so that 
Cwu = Cov{Xiu(t), Xu{t)). For each w, let be a realization of X^it). Then the log likelihood 
of y G M is 

log>C(y) = - "^{x^ - y)C;^^{xu - y) + const, 

wu 

where is C's pseudo-inverse, this expression is maximal for 

_ X^WM ^WU-^W 

l^wu 



wu 



Hence, the MLE, and therefore also + 1), equals 

XML-Xy[t+i)- — — I . (4j 

l^w,u&N{v) ^wu 

Note that + is also, among all the unbiased estimators of /u constructible over the estimators 
in Ev{t), the one with the minimum variance. 

Given this last observation, there exists a simple geometric interpretation for Eq. [H 

Proposition 2.2. It is possible to calculate {Xy{t)\v S V}, given {X^{t — l)\v E V}, by performing 
n operations of finding the point of an n dimensional affine space ( as specified by a generating set 
of size at most n ) with minimal L2 norm. 

Proof. Consider an n-dimensional vector space V over R, with an inner product (•,•). Let z be 
some non-zero vector in V, and let ^ C V be the affine space defined by ^ = {x G V| (x, z) = 1}. 

(This is a generalization of V = span({Xt,|f G V}), {X,Y) = Cov(X, y), z = YIvgv ^^'^ 
being the set of unbiased estimators). 

Given a set of vectors E = {xk\k = 1, . . . , K < n}, where G A, let C^i = (xfc,Xi). Then 
A n span(£') is also an affine space, and the minimum L2 norm vector in ^ R span(£') is 

^ML = 1 , 

l^kl ^kl 

where is the matrix pseudo- inverse of C. This equation is identical to Eq. SI □ 

Note that if C is invertible then its pseudo-inverse is equal to its inverse. Otherwise, there are 
many linear combinations of the vectors in E which are equal to the unique :x.ml- The role of the 
pseudo-inverse is to facilitate computation: it provides the linear combination with least sum of 
squares of the coefficients [6]. 

2.3 Convergence 

Denote V^{t) := Var[X^(t)] and C^,^(t) := Cov(X„(t), 

Lemma 2.3. All estimators have the same limiting variance: 3p^\/v : Vv{t) ^ p^o 
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Proof. Since, in every iteration, each agent calculates the minimum variance unbiased estimator 
over those of his neighbors and its own, then the variance of the estimator it calculates must be 
lower than that of any other unbiased linear combination: 



w£N(v) 



1 ^ Vy{t + 1) < Var 



weN{v) 



In particular, for each neighbor w oi v 

Vvit + 1) < vut), 

and since each vertex is its own neighbor, then 

V^{t + 1) < V^{t). 



(5) 



(6) 



(7) 



Therefore, since the variance of each agent's estimator is monotonously decreasing (and positive), 
it must converge to some pv Now assume {v,w) £ E and p-u < Pw, then, at some iteration t, 
Vv{t) < Pw ^ Vwit). But then, by Eq. [6l we have Vw{t + 1) < < pw - ^ contradiction. 

Therefore, p^ must equal pw, and since the graph is connected, all agents must converge to the 
same variance, poo- □ 

Lemma 2.4. \/v,w : Cvw{t) poo 

Proof. The previous lemma is a special case of this one, for when v = w. Otherwise, for two 
neighboring agents v and w, for any e, there exists an iteration t where both Var[X„(t)] < p^^ + e 
and Var[X^(t)] < poo + e. Then: 

Var[i(X4t) + X^m = i [V,{t) + VUt) + 2a^(t)] <^[poo + e + C^Ut)] 

and since Eq. [5] implies Vu{t + 1) < Var[i(X„(t) + X^it))], then poo < ^ [Poo + e + C^wit)] and 
Cvw{t) > Poo — £• Since Cvw is also bounded from above: Cvw{t) < y^Var[Xt,(t)] Var[X^(t)] < 
Poo + E) we have demonstrated that Ct,^(i) — > poo when v and w are neighbors. This implies that the 
correlation between neighbors converges to 1, and therefore, since the graph is finite, all correlations 
converge to 1 and all covariances converge to poo- Q 

This last lemma implies that if one agent's estimator converges, then all others' also converge, 
to the same limit. Even without convergence, however, it implies that all the estimators converge 
to their average: 



lim Var 

t— >oo 



0. 



(8) 



The following lemma will be used to conjecture that all the estimators do converge. It states 
that an estimator is uncorrelated to the difference between it and any of the estimators which were 
used to calculate it. 

Lemma 2.5. \fw e N{v) : Cov(X^(t + 1), + 1) - = 

Proof. We examine the estimators X{P) = Xv{t + 1)(1 — /5) + Xw{t)(3, which are also unbiased 
estimators of p, and are linear combinations of the estimators from which X^{t+1) was constructed. 
They should all therefore have higher variance than X^{t + 1). Since X{p = 0) = Xy{t + 1), then 







dVar[X] 

5/3 



(3=0 
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Now: 



dVar[X] 

dl3 (3=0 

d [(1 - f3fVar[X,{t + 1)] + P^Yar[X^{t)] + 2/3(1 - /3)Cov(X,(t + l),X^{t))] 



dp p=o 
-2(1 - P)Var[X^{t + 1)] + 2/3Var[X^(t)] + 2(1 - 2/3)Cov(X„(t + l),X^{t))] 

-2Var[X4t + 1)] + 2Co^r{X^{t + l),X^{t)) 
-2Cov {X^{t + l),X^{t + l)- X^{t)) 



(3=0 



and so Gov (X„(t + 1), X„(t + 1) - X^(t)) = 0. □ 

Note that this imphes that Var[X^{t + 1)] = Cov{X^{t + l),Xy,{t)). 
Conjecture 2.6. (Convergence) 3X{oo)\/v : Xv{t) X{oo). 

The fohowing observation supports this conjecture: 



Yar[X,{t + l)-X,{t)] = Cov {X,{t + 1) - X,{t), X,{t + 1) - X,(t)) 

= Cov {X^{t + l),X^{t + l)- X^{t)) - Cov iX^{t),X4t + 1) - X^{t)) 



Using Lemma 



-Cov(X,(t),X,(t + l)-X,(t)) 
V,{t)-Cov{X,{t+l),X,{t)), 



and using it again: 



= K(t)-K(t + l). 

This implies that if to is such that K,(to) = Poo + e and therefore X^^j^ Vu{t) — Vu{t + 1) = e, then 

oo 

^Var[X,(t + l)-X,(t)] =e. 

t=to 



2.4 Efficiency for Transitive Graphs 

Vertex transitive graphs (henceforth referred to as transitive graphs), are graphs where all vertices 
are essentially equivalent, or "equally important". Alternatively, one may say that the graph "looks 
the same" from all vertices. Formally, G = {V, E) is transitive iff, for every pair of vertices VjW £ V 
there exists a function f : V ^ V which is a graph automorphism (i.e. / is a bijection and 
(a, b) £ E 4^ ifi(^)j f{b)) S E) and maps v to w. 

Proposition 2.7. When G is transitive then the process converges and X{oo) = Z{oo). 

Proof. By the symmetry of the graph, the average of the agents' estimators cannot give more weight 
to one agent's original estimator than to another: 

-^^X,(t) = -^^X, = Z(oo), 
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and hence the average of the agents' estimators is constant and in particular converges. By 
lemma 12.41 (Eq. [8|) , if the average converges then each of the estimators converges to the same 
limit: 




□ 

Note that for regular graphs (i.e. graphs where all vertices have the same degree), which are a 
superset of transitive graphs, y(oo) = Z{oo). 



3 Analytic Examples 

Complete analytical analysis of these iterations for general graphs seems difficult, since Eq. U] is 
quadratic. In fact, we found only two simple examples amenable to complete analysis: The star, a 
graph with a central node connected to all other nodes, and the interval of length four, a graph of 
four linearly ordered nodes. 

In the former, we show that the minimum variance scheme is efficient, so that X(oo) = Z{oo). 
In the latter, we show that it isn't, but that the "price of anarchy" is low. 



3.1 High degree graphs and the star 

We consider a graph of n vertices, of which u is the central node and is connected to all others, 
and no additional edges exist. 

The averaging estimator Y{oo) gives weight ^^_2 to X-^ and ^^_2 to the rest. Its variance is 
"(3n-2)"^'^ , which is asymptotically |. 

On the other hand, ^u(l) = Z{oo), since node u, neighboring all nodes of the graph, im- 
mediately finds the global minimum variance estimator. In the next iteration, all nodes w set 
X^(2) = and the process essentially halts, since all nodes have the same estimator, X^{2) = 

X{oo) = Z{oo), with Var[X(cx))] = i. 

In general, in graphs of large maximal degree, Xy{t) converges to fi: 

Proposition 3.1. Let Gn = {Vn,En) be a family of graphs where \ Vn\ = n and maXy^Vn dy — > oo. 
Then 

lim sup lim E[{Xy{t) - fif] = 0. 

Proof. Since all estimators at all iterations have mean /i, then E[(X^(f) — //)^] = Var[Xt,(f)]. By 
lemma [2?3t the limiting variances of all the agents in a graph G„ are equal to some pn, and therefore 

lim sup lim E[{Xv{t) — /x)^] =0 ^ lim /3„ = 

n— >oo t,gy^ t— >oo n^oo 

The condition maXy^Vn ^ oo implies that given e > 0, there exists a high enough N, so 
that in any Gn with n > N there exists a node Wn with degree d^^ larger than 1/e. Then 
Var[Xt„^(l)] < e, since agent Wn would, on the first iteration, average the estimators of all its 
neighbors, resulting in a new estimator of variance l/dw„- Since variance never increases in the 
iterative process (lemma [2. 3p . then pn < e for n larger then some A^. Since this is true for arbitrary 
e, Pn goes to zero as n goes to infinity. □ 
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Figure 1: Interval of Length Four 
3.2 Interval of Length four 

We analyze the case of G = {V, E) where V = {a, 6, c, d} and E = {{a, 6}, {6, c}, {c, d}} . 
In Appendix [Al we prove that the process converges with 

Var[X(oo)] = 2- ^/3 = i(l + \/49 - ^48) >\= Var[Z(cx))], 

thus proving that a case exists where X[qo) ^ Z{oo). We also, for this case, derive an asymptotic 
convergence rate of 2 — \/3- 

The averaging estimator Y{oo) is: 

y(oo) = 0.2Xa + 0.3Xb + 0.3Xc + 0.2Xd, 

with 

Var[y(oo)] = 0.26. 

This is slightly lower than Var[X(cx))], which equals about 0.268. However, the convergence rate 
(second eigenvalue) for the averaging process is ^ + \^33/12 ~ 0.73, which is significantly slower 
than the minimum variance process's asymptotic rate of 2 — -v/3 ~ 0.268. 

4 Numerical Examples and Conjectures 

Numerical simulations on intervals of lengths larger than four suggest a surprising result. 
4.1 Interval of Arbitrary Length 

Numerical simulations suggest that X{oo), for intervals of length n, approaches a normal distribu- 
tion around the center of the interval, with variance proportional to n: 

Conjecture 4.1. For interval graphs of length 2n, index the agents by k £ {0, . . . , 2n — 1}. Then 

X{oo) = Y,AkXk 

k 

where approaches a normal distribution in the sense that 

lim y f Afc - c„e-('^-"+i/2)2/^W'\^ = with v{n) G e(n), C„ g M. 

k 

This implies that while lim„^oo E[(X(cxo) — ^)^] = 0, X(oo) quickly becomes less efficient when 
compared to Z(oo): 

Conjecture 4.2. Var[X(oo)] oc y^Var[Z(oo)] . 

Note that Var[y(cxD)], on the other hand, approaches Var[Z(oo)] as n increases, for intervals 
of length n. 
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4.2 Agents with Memory 

A model which is perhaps more natural than the memoryless model is the model in which the 
agents remember all their own values from the previous iterations. 

Proposition 4.3. // the agents have memory, then the process converges to Z{oo). 

Proof. Since each vertex v always remembers = Xy{0), then is always part of the set over 
which Xv{t) was constructed. Then, by Lemma l2.5t 

Cov(X,(t),X,) = Var[X„(t)], (9) 

and by Lemma l2.4t 

yv,w £V : lim Cov(X^(t), X^,) = lim Cov{Xy{t), X^) = lim Var[Xy{t)] = poo- 

t^QO t— too t^OO 

This means that for any agent w, the covariance of its limit estimator with each of the original 
estimators Xy is identical, and so it must be their average: X{oo) = Z{oo). □ 

This proof relied only on the agents' memory of their original estimators. Since they also gain 
more estimators over the iterations, and seemingly expand the space that they span, we conjecture 
that: 

Conjecture 4.4. \fveV: Xy{t) = X{oo), for t>\V\. 

5 Conclusion 

An number of interesting open problems can be raised with respect to this model, some of which 
we conjecture about above: 

• Does it always converge? We conjecture above that this is indeed the case. 

• For what graphs does it converge to the optimal estimator Z(oo)? 

• Otherwise, what is the "price of anarchy", Var[X(c)o)]/Var[Z(oo)]? Is it bounded? We 
conjecture above that it isn't. 

• What is the convergence rate? 
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A Analysis of Interval of Length Four 



Theorem A.l. Let G = {V,E) be the interval of length four where V 
{{a,b},{b,c},{c,d}} . Then X.^{t) converges to a limit X{oo), where 

X(00) = i [(1 - OiXa + Xd) + (1 + 0(^6 + ^c)] 

withi = 2-^/^= i(l + ^/49- \/48). 



{a, 6, c, d} and E 
Var[X(oo)] = e, 



Proof. We define Myyj{t) = Cav{Xy,Xyj[t))^ so that each column of M is the coordinates of 
an agent's estimator at time t, viewed as a vector in the space spanned by {Xa,X}j,Xc,Xd}. 
We define Z(oo)-subtracted M as M^wit) = Cov(Xt, — Z(oo),X^(t) — Z(oo)), where Z{co) = 
\{Xa + Xi, + Xc + Xd), and hkewise define the Z(oo)-subtracted covariance matrix Cvw{t) = 
Cov{X^{t) - Z{oo),X^{t) - Z(oo)). 

We now shift to an ahernative orthonormal basis B: 



B 



r /i/2\ 

1/2 
1/2 

I v/v 



(= bi), 



/-1/V2\ 



V 1/V2J 



b2), 



( 
\ 



/-l/2\ 
1/2 
1/2 

v-i/2y 



(= M 



The vector 61 (= 2Z(oo)) was chosen because its coordinate is one half in every unbiased estimator 
(and zero for any Z(oo)-subtracted unbiased estimator). ^2 and 63 are anti-symmetric to inversion 
of the interval, a transformation which should leave X{oo) invariant by the symmetry of the graph. 
Therefore we expect their coordinates in X(cxd) to vanish. We have no freedom, then, in choosing 
the last vector, and expect X{oq) to equal \bi plus some constant ^ times ^64: 

X(CX)) = \[{l- i){Xa + Xd) + (1 + + X,)] . 

Performing two iterations of the process reveals that under this basis, the Z(cxD)-subtracted 
coordinates matrix of the estimators at iteration two. My (2), is of the form: 



Mb(2) 



X 








z 

\y w 






— z 
w 



— X 



V I 



with 



yw = z + w . 

Application of another iteration yields a matrix of the same form: 



(10) 



Ms (3) 



/o 







—xz ^ 

+ — w)-^ 



-{y-w) 





xz ^ 



x'^ -\-(y~w)'^ 







—z 
w 



(11) 



with the relation of Eq. [10] preserved. 

Since the result is a matrix of essentially the same form, equivalent equations apply for consec- 
utive iterations, and we may denote as xt, yt, wt and zt the corresponding matrix entries at time 
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Since Eq. [TT] implies that yt = wt~i and xt = zt-i, then if wt and Zf converge then the process 
converges and X{oo) exists. Also: 

wt+i = — r^wt (12) 

and 

= — To^t- (13) 

Zt_i + [Wt-l - WtY 

Dividing Eq. [T2]by Eq. [T3t we discover that: 

Wt+l _ Zt-l Wt 
Zt+1 Zt Zt 

and therefore, by repeated application: 

W2Z2 Zt 1 Zt 



Wt 



Z3 Zt-l 2 Zt-l 

Eq. [TUlcan alternatively be written as: wt-iwt = Wf + z^. Then: 

wt-i -wt = zf/wt = 2ztzt-i, 

and we can write Eq. [13] as: 

Zt-lZt 

or 

i = £^(l + 4z2). (14) 

Zt+l Zt^ ^' 

To solve this recursion we make the following guess: 

Zt 



2 + ^3-4z/, (15) 

Zt+l ^ 

which is a solution of the following quadratic equation in zt/ zt+i'- 

^_4^ + l + 4z2 = o. 
zf+i zt+i 

This is equivalent to the following relation: 

Var[X;,(t)] = wl + z^ + \ = 2wt, 

upon which we serendipitously stumbled during our examination of this problem. 

This guess satisfies Eq. [HI as some manipulation of the two equations will show. Since Z2 and 
Z3 satisfy Eq. 1151 then the rest of the z's must, too. 

Since Eq. [T^ implies — > 0, we can conclude from = 2 + w3 — 4z2_^ that 



lim £^ = 2 - ^/3 := e 

i-^oo Zt 



Zt+1 
Zt 

This is the process's asymptotic convergence rate. Since wt = ^^f^j then wt — > and 

X(oo) = i [(1 - OiXa + Xd) + (1 + i){X, + X,)] , 



with 



1\2 fl-^ 



Var[X(oo)] = ( - 1 +1-^) =6 



□ 
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